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Abstract. We introduce a new test procedure of independence in the framework of para- 
metric copulas with unknown marginals. The method is based essentially on the dual rep- 
resentation of x 2_ divergence on signed finite measures. The asymptotic properties of the 
proposed estimate and the test statistic are studied under null and alternative hypothesis, 
with simple and standard limit distributions both when the parameter is an interior point 
or not. 

1 Introduction and motivations 



Parametric models for copulas have been intensively investigated during the last decades. 
Copula have become popular in applied statistics, because of the fact that they constitute 
a flexible and robust way to model dependence between the margins of random vectors. In 
this framework, semiparametric inference methods, based on pseu d o-like li hood, have been 



applied to copulas by a n umber of authors (see, e.g.. lShih and Louisl (119951 ) 



fl2000h 



Wang and Ding 



Tsukaharal (120051 ) and the references therein). Throughout the available literature, 



investigations on the asymptotic properties of parametric estimators, as well as the relevant 
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test statistics, have privileged the case where the parameter is an interior point of the ad- 
missible domain. However, for most parametric copula models of interest, the boundaries of 
the admissible parameter spaces include some important parameter values, typically among 
which, that corresponding to the independence of margins. This paper concentrates on this 
specific problem. We aim, namely, to investigate parametric inference procedures, in the 
case where the parameter belongs to the boundary of the admissible domain. In particular, 
the usual limit laws both for parametric copula estimators and test statistics become invalid 
under these limiting cases, and, in particular, under marginal independence. Motivated 
by this observation, we will introduce a new semiparametric inference procedure based on 
X 2 -divergence and duality techniques. We will show that the proposed estimators remain 
asymptotically normal, even under the marginal independence assumption. This will allow 
us to introduce test statistics of independence, whose study will be made, both under the 
null and alternative hypotheses. 

It is well known since the work of ISklarl (119591 ) that the joint behavior of a bivariate vec- 



tor (Xi,X 2 ) with d.f. F{x\,x 2 ) '■= P{X\ < xi,X 2 < x 2 ), and continuous marginal d.f.'s 
Fi(xi) := P(Xi < Xi), i — 1,2, is characterized by the copula (or dependence function) C 
associated with F(-, ■). The copula function is defined, for all (u\, u 2 ) G (0, l) 2 , through the 
identity 

C(« a ,« a ) :=P{F 1 (X 1 )<u 1 ,F 2 (X 2 )<u 2 }. 



Many useful multivariate models for dependence between X\ and X 2 turn out to be generated 
by parametric families of copulas of the form \Cp\ 9 G 9|, typically i ndexed by a vector val- 
ued par amete r 6 G 9 C MP (see, e.g.. lKimeldorf and Sampson! (jl975al ). lKimeldorf and Sampson 



(1l975bh . and 



tiated by 



Deheuvels ( 



In addition^ 



fsee, also 



Joel (19931)). T he nonparametric approach to copula estimation has been ini- 



Dcheuvels ( 



979bj), who i ntroduced and investigated the empirical copula process. 



1980 



Fermanian et al. 



1981) described the limiting behavior of this empirical process 



2004al ) and the references therein). In this paper, we consider 



semiparametric copula models with unknown general margins. 
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In order to estimate the unknown true value of the parameter 9 G 0, which we denote, 
throughout the sequel, by 9 T G 6, some semiparametric estimation procedures, based on 
the maximization, on t he parameter space G, of prope rly chosen pseudo-lik e lihood criterion, 



have 



been 



( 11995( 1 and 



jroposed by 



Oakcs ( 



1994). and studied 



Wang and Ding (120001 ) . iTsukaharal (120051 ) among others. In each of these papers 



Genest et al. 



(1995) 



Shih and Louis 



some asymptotic normality properties are established for -Jn(9 — 9 T ) , where 9 = 9 n denotes 
a properly chosen estimator of 9t- This is achieved, provided that 9? lies in the interior, 
denoted by 0, of the parameter space C W. On the other hand, the case where 9t G <90 : = 
— is a boundary value of 0, has not been studied in a systematical way until present. 
Moreover, it turns out that, for the above-mentioned estimators, the asymptotic normality 
of y/n{6 — 9t), may fail to hold for 9t G <90; indeed, under some regularity conditions, 



when 9 is univariate, we can prove that the limit law is the distribution of Zt 



(Z>0) 



where 



Z is a centred normal variable, and that the limit law of the generalized pseudo-likelihood 
ratio sta tistic is a mixture of y? law with one degree of freedom and Dirac measure at 



zero; see 



Bouzebda and Kezioul (120071 ). Furthermore, when the parameter is multivariate, 



the derivation of the li mit distributions under the null hypothesis of independence, becomes 



much more complex; see 



Self and Liang) (119871 ). Also, the limit distributions are not standard 



which yields formidable numerical difficulties to calculate the critical value of the test. We 
cite below some examples of parametric copulas, for which marginal independence is verified 
for some specific values of the parameter 9, on the boundary <90 of the admissible parameter 
set 0. We s t art w ith examples for which 9 varies within subsets of R. Such is the case for 



the 



Gumbell (jl960l ) family of copulas, which is one of the most popular model used to model 



bivariate extreme values. This family of copulas is defined, for < Ui,u 2 < 1, by 

C ( Ul ,u 2 ) := exp {- [(- log Ul )° + (- \ogu 2 ) e ] 1/9 } , for 9 G [1, oo). (1.1) 



A useful family of copulas, due to I Jod (119931 ). is given, for < Ui,u 2 < 1, by 



C e ( Ul , u 2 ) := 1 - [(1 - Ul ) e + (1 - u 2 ) e - (1 - Ul ) e (l - u 2 ) e ] 1/9 , for 9 G [1, oo). (1.2) 
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The IGalamboa (119751 ) family of copulas is given, for < U\, u% < 1, by 

Ce{u 1 ,u 2 ) := MiM 2 exp | [(-logwi)~ 9 + (-\ogu 2 )~ e ] 1/6 \ , for 9 G [0, 



OO . 



1.3) 



The 



Hiisler and Reisd (119891 ) family of copulas is defined, for < u%, u 2 < 1, by 



Ce(u u u 2 ) := u 



^(e-i + f l0g[(-l0 g M 2 )/(-l0g«l)]) d-^-i + f l0g[(-l0g Ul )/(-l0g U2 )]) 



:i-4) 



for # G [0, oo), with $(•) denoting the standard normal AT(0, 1) distribution function. 
The Gumbel-Barnett copulas are given, for < Ui,u 2 < 1, by 

Ce(rxi, m 2 ) := Wiw 2 exp{-(1 - 0)(logtt 1 )(logw 2 )} , for 0e[0, 1]. 

The Clayton copulas are such that, for < u\,u 2 < 1, 



(1.5) 



Cq^Vv) = (u x 6 



1) 



-1/0 



for 9 G [0, oo) 



;i.6) 



Parametric families of copulas with parameter 9 varying in R, p , for some p > 2, include the 
following classical examples. Below, we set 9 = (9i,9 2 ) T G R 2 . 

C e ( Ul , u 2 ) := { 1 + [{u-^ - l) e > + (V 1 " I)' 2 ] 1/92 y 1/dl , G]0, oo[x [1, oo[; (1.7) 
C e (ui,u 2 ) := exp | - 6> 2 _1 log (exp (-6» 2 (log «i) ei ) 



1.8) 



+ exp(-0 2 (log M2 ) ei ) -1 



9 G [l,oo[x]0,oo[. 



For other examples of the kind, we refer to lJod (119971 ). 

For each of the above examples, the independence case Cg T (ui,u 2 ) = u\u 2 occurs at the 
boundary of the parameter space 0, i.e., when 9 T = 1 for the models (11.11) . (II. 2p and (II. 5p . 
9t = for the models (11.31) . (11.41) and (11.61) . 9? = (0, 1) for the bivariate parameter model 
(11. 7p . and 9t = (1,0) for the bivariate parameter model (jl.8p . In the sequel, we will denote 
by 6*o the value of the parameter (when it exists), corresponding to the independence of the 
margins, i.e., the value of the parameter for which we have 



Ce (ui,u 2 ) :=uiu 2 , for all (u l7 u 2 ) G (0, l) 2 . 



Hence, O = 1 for the models (Oj) . (TT2D and (JH5J), # = for the models (Ojl . (OP and 
(Oj) . # = (0, 1) for the model (JEZJ, and # = (1, 0) for the model (Oil . 

In contrast with the preceding examples, where 9$ G 90 is a boundary value of G, the case 
where 9 is an interior point of G may, at times, occur, but is more seldom. An example 
where 9q G Q is given by the Farlie-Gumbel-Morgenstern (FGM) copula, defined by 

C e {u 1 ,u 2 ):=u x u 2 + 9u 1 u 2 {l-ui){l-u 2 ), 0G0:=[-1,1], (1.9) 

and for which 9 = G =] - 1, 1[. 

In the present article, we will treat, in turn, parametric estimation of 9t, and tests of the 
independence assumption 9? = 9 . We consider both the case where 9q G is an interior 
point of 0, and the case where 9q G <90 is a boundary value of 0. To treat this case, 
we propose a new inference procedure, based on an estimation of x 2 -divergence by duality 
technique. This method may be applied independently of the dimension of the parameter 
space. Also the limit law of the estimate of the parameter is normal and the limiting 
distribution of the proposed test statistic is \ 2 under independence, either when 9 is an 
interior point, or when 9q is a boundary point of 0, independently of the dimension of the 
parameter space. The idea is to include the parameter domain into an enlarged space, say 
e , in order to render 9q an interior point of the new parameter space, e . The conclusion is 
then obtained through an application of x 2 -divergence and duality technique. Our methods 
rely on the fact that, under appropriate assumptions, the definition of the density Cg(-, •) : = 
dl ^ du2 Co(-,-) of Cq(-, •), pertaining to the models we consider, may be extended beyond the 
"standard" domain of variation of 9. On the other hand, the definition of c#(-, •) which 
corresponds to these extensions, is then, in general, not any more a density, and may, at 
times, become negative. For example, such is the case for the parametric models (11. lj) . ( 11.2j) . 
(11. 3p and (II. 4p . for which cg(-,-) is meaningful for some 9 G" 0, but then, becomes negative 
over some non-negligible (with respect to Lebesgue's measure) subsets of (0, l) 2 . This implies 
that the log-likelihood of the data is not properly defined on the whole space O e . For this 
reason, we will use the x 2 -divergence between signed finite measures. We will discuss this 
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problem in more details, below, in section [2j 

The remainder of the present paper is organized as follows. In section [21 we present our semi- 
parametric inference procedure, based upon an optimization of the x 2 -divergence between 
the model (Cg, 9 G e ) and the empirical copula associated to the data, and by using the 
dual representation of x 2 -divergence. We then derive the asymptotic limiting distribution of 
the proposed estimator. It will become clear later on from our results, that the asymptotic 
normality of the estimate holds, even under the independence assumption, when, either, 9q 
is an interior, or a boundary point of O, independently of the dimension of the parameter 
space. The proposed test statistic of independence is also studied, under the null hypothesis 
7io of independence, as well as under the alternative hypothesis. The limiting asymptotic 
distribution of the test statistic under the alternative hypothesis is used to derive an approx- 
imation to the power function. An application of the forthcoming results will allow us to 
evaluate the sample size necessary to guarantee a pre-assigned power level, with respect to 
a specified alternative. Finally, section H] reports a short simulation results. The proofs of 
these results will be postponed to the appendix. 

2 A semi-parametric estimation procedure through % 2 - 
divergence 

As mentioned earlier, the problem of estimating 9, when 9 G dO, has not been systematically 
considered in the scientific literature; and the classical asymptotic normality property of the 
estimators is no more satisfied. To overcome this difficulty, in what follows, we enlarge 
the parameter space 9 into a wider space e D 6. This is tailored to let 6*0 become an 
interior point of G e . Naturally, we assume that the definition of the function eg may be 
extended to e . The difficulty associated with this construction is that, subject to a proper 
definition, the densities cq of Cg with respect to the Lebesgue measure, may become negative 
on some non negligible subsets of I := (0, l) 2 (in this case eg becomes the density of a signed 
measure, see remark 13721) . Note that just as Deheuvels empirical copula is not a copula, is 
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not necessarily a copula density and fail to integrate to 1. When such is the case, a semi- 
parametric estimation of 9t via log-likelihood cannot be used. To overcome this difficulty we 
introduce a new inference procedure, based on x 2 -divergence methods, and duality technique. 
Recall that the x 2 -divergence between a bounded signed measure Q, and a probability P on 
T>, when Q <C P is absolutely continuous with respect to P, is defined by 

X 2 (Q,P):= f v(¥p \ dP, where p : x G R i- <p(x) := ^ (x - i f . (2.1) 

In the sequel, we denote by x 2 ($o ; $t) the y 2 -dive rgence between Cg n and . Ap plying the 



dual representation of 0-divergence obtained by iBroniatowski and Kezioul (120061 ) Theorem 



4.4, we readily obtain that x 2 {@0j@t) can be rewritten into 

X 2 (e ,9 T ) := sup jjf / dC eo - jf ^*(/) dC 0T | , (2.2) 

where y?* is used to denote the convex conjugate of <p, namely, the function defined by 

t 2 

<f* : t G R i— > </?*(£) := sup {te — y?(a;)} = — - + t, 

a;GR 2 

and .T 7 is an arbitrary class of measurable functions, fulfilling the following condition 

V/ G T\ J \ f\ dCo is finite and <p' '(dCg / 'dCe T ) = <p'(l/co T ) G T. Furthermore, the sup in 

( 12. 2p is unique and achieved at / = ip'(l/ce T ). 

Define the new parameter space G e of 9 as follows. Set 

6 e := |# G R d such that j \ip'(l/cg(ui,u 2 ))\ du l du 2 < ooj . (2.3) 
By choosing the class of functions via 

T := {{m, u 2 )El^ ip'(l/c e ( Ul , u 2 )) - 1 ; 9 G G e } , 
we infer from (I2.2p the relation 

X 2 (9 ,9 T ) := sup { [ ( 1 - l) du x du 2 - [ (\ 1 - dC, T K,« 2 )}<2.4) 
eee e ^ Ji \c e {ui,u 2 ) J J T \2 c$(ui,u 2 y 2 J ) 
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It turns out that the supremum in (12.41) is reached iff 6 = 6t- Moreover, in general, 9t 
is an interior point of the new parameter space G e , especially under the null hypothesis of 
independence, namely, when 6 T = 9 . Set 



m(9,ui,u 2 ) 



1 ] du\du2 



1 



1 



1 

^~2 



(2.5) 



I j \c 9 {ui,u 2 ) J l 2 c e (u 1 ,u 2 y 

In what follows, we propose to estimate the x 2 -divergence x 2 (#o ; #t) between Cg and Cg T , 
by 

X 2 (#o, Ot) '■= sup / m(0,«i,u 2 ) dC n (ux,u 2 ), (2.6) 
0ee e J i 

and to estimate the parameter 9t by 



n := arg sup < / m(9,ui,u 2 ) dC n (u 1 ,u 2 ) 
eee e Ui 

where C n is the modified empirical copula, defined by 

1 " 

C n (u 1 ,U 2 ) := -^2^{F ln (X lk )<u 1 }^-{F 2n (X 2k )<u 2 }, (UX,U 2 ) & I, 
k=l 

and F jn (t) := \ Y?k=\ l]-oo,t](-Xj*), 3 = 1, 2. 



(2.7) 



(2.8) 



Remark 2.1 TTie set O e defined above is generally with non empty interior B e . In partic- 
ular, we may check that 6q (the value corresponding to independence) belongs to e , since 
the integral in A2.3\) is finite; it is equal to zero when 6 = 6 Q , for any copula density eg. 
For example, in the case of FGM copulas U.9\) . it is easy to show that B e = R. However, 
the explicit calculation of the integral in A2.3\) may be complicated for some copulas, in such 
cases we use the Monte Carlo method to compute this integral for all 9 e e . 



Statistic of the form 



V>(wi,w 2 ) dC n (u 1 ,u 2 ) 



belong to the general class of multivariate rank statistics. Their asymptotic 



been i n vestigated at 



fll972f ) 



Ruymgaart 



engt 



i by a number of authors, among whom we may cite 



properties have 



Ruymgaart et al 



(119741 ) and iRiischendorfl (J1976J). In particular, the previous authors have 



provided regularity conditions, imposed on •), which imply the asymptotic normality of 



u. 



he c orresponding arguments have been modified by IGenest et al\ (119951 ). iTsukahara 



(120051 ) and iFermanian et al\ ( l2004bl ). as to establish almost sure convergence of the estima- 
tors they consider. In the same spirit, the limiting behavior, as n — > oo, of our estimator and 
test statistic T n , will make an instrumental us e of the general t h eory o f multivariate rank 



statistics. Using some similar arguments as in iQin and Lawless! (119941 ). the existence and 
consistency of our estimator and test statistic will be established through an application of 
the law of the iterated logarithm for empirical copula processes, in combination with general 
arguments from multivariate rank statistics theory. In the sequel, without loss of generality, 
we will limit ourselves to the case where the parameter is univariate. The extensions of our 
result in a multivariate framework may be achieved, at the price of additional technicalities, 
and under similar assumptions. We will make use of the following definitions. 

Definition 2.1 (i) Let Q be the set of continuous functions q on [0, 1] which are positive 
on (0, 1), symmetric about 1/2, increasing on [0, 1/2] and satisfy J {q(t)}~ 2 dt < oo. 



(ii) A function r : (0,1) 
increasing on (0, 1/2]. 



(0, oo) is called u-shaped if it is symmetric about 1/2 and 



(Hi) For < (3 < 1 and u-shaped function r, we define 



r{(3t) if < t < 1/2; 

r{l-/3(l-t)} if 1/2 < t< 1. 

If for (3 > in a neighborhood ofO, there exists a constant Mp, such that r@ < M@r 
on (0, 1), then r is called a reproducing u-shaped function. We denote by 71 the set of 
reproducing u-shaped functions. 

Typical examples of elements in Q and 71 are given by 

q(t) = [t(l - t)] C , < C < 1/2, r(t) = g[t(l- t)P , q > 0, Q > 0. 

We will describe the asymptotic properties of the proposed estimate 6 n under the following 
conditions. 
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(C.l) There exist functions r± t k,r 2t k £ 72. such that 



de k 



m(9 T ,u 1 ,u 2 ) < ri >fc (ui)r 2 ,fc(u2), for fc = l,2, 



and 



<9 3 



m(9,Ui,u 2 ) < ti,3(mi)^2,3(w2) on a neighborhood N(9 T ) of T , 



where / 7 {ri ijfc (ui)r 2ijfc (u 2 )} 2 dC dT (ui, u 2 ) < oo for fc = 1,2,3; 

(C.2) The function w 2 ) G 7" i— > rn(9 T , U\, u 2 ) is of bounded variation on /; 

(C.3) For each 0, the function J| m(#, ui, u 2 ) : / — > R is continuously differentiable, and 
there exist functions r\ G TZ, G 7?. and G <2 for « = 1, 2, such that 
d 2 



d9du,; 



m(0,ui,u 2 ) < ri(ui)rj(uj),i,j = 1,2 and i ^ j, 



and 



J {qi(ui)ri(ui)r j (u j )}dC0 T (u 1 ,u 2 ) < oo, i,j = 1,2 and i ^ j. 



Set 



and 



where 



E := -£ 



g,m (9 T ,F 1 (X lk ),F 2 (X 2k )) 



(2.9) 



S 2 := Var m (0 T , F^), F 2 (X 2 )) + W^t, X x ) + ^ 2 (fl T , X 2 )] , (2.10) 



Wi(0 T ,JQ) := y l {Fi (x l )<„ l }^7 m ( 6l T,Mi,M 2 )c (?T (Mi,M 2 ) du x du 2 , % = 1,2. 
We can see that S and Wj(JQ) can be defined, respectively, by 



~ = £ [JX (0 T , F x (X lfc ), F 2 {X 2k ))\ 



and, for i = 1,2, 



W t (9 T ,X t 



^{Fi{Xi)<ui}-^rn (9 T ,ui,u 2 ) -§^m (9 T ,u 1 ,u 2 ) ce T (u 1 ,u 2 ) du\du 2 . 
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Theorem 2.1 Assume that the conditions (C. 1-2-3) hold. 

(a) Let B(0T,n~ 1 ^ 3 ) := {# G @ e , \9 — 9t\ < n -1 / 3 }. Then, as n — * oo, with probability one, 
the function 9 i— > J m (9,Ui,U2) dC n (ui,U2) reaches its maximum value at some point 
9 n in the interior of the interval B(9T,n~ 1 ^ 3 ). As a consequence, the estimate 9 n of 9 
is n 1//3 -consistent, and satisfies 

J jjgin (9 n , ux, u 2 ^j dC n (u 1 ,u 2 ) = 0. 

As n — > oo, 

v^(^n-^)^iV(0,S 2 /H 2 ). 
The proof of Theorem 12.11 is postponed to section [6j 



3 A test based on "x 2 -divergence" 

One of the motivations of the present work is to build a statistical test of independence, 
based on x 2 -divergence. In the framework of the parametric copula model, the null hypoth- 
esis, namely, the independence case Cg(ui,u 2 ) = uiu 2 corresponds to the condition that 
Ho : 9 T = 6> . We consider the alternative composite hypothesis Hi : 9 T ^ 9 . The 
corresponding generalized pseudo-likelihood ratio statistic is then given by 

S rg Q q\ - 2 log SUPege H^ =1 c e(Fm(X lk ), F 2n (X 2k )) 

nLi Ce (Fln(X lk ), F 2 n(X 2 k)) 

where, for j = 1, 2, Fj n stands for n/(n+l) times the marginal empirical distribution function 
of the j-th variable Xj. The rescaling by the factor n/(n + 1), avoids difficulties arising from 
potential unboundedness of logc#(ui, u 2 ), when either u\ or u 2 tends to 1. Since, #0 is a 
boundary value of the parameter space 0, we can see, that the convergence in distribution 
of S n to a x 2 random variable is not likely to hold. In order to bring a solution to this 
problem, we introduce the statistic 

Tn^oA) := 2n sup / m(9, u x , u 2 ) dC n (u x , u 2 ). (3.1) 
eee e J 1 
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Below, we will show that, under the null hypothesis Ho, the just-given statistic T n (9 ,9 n ) 
converges in distribution to a x 2 random variable. This property allows us to build a test 
of H against Hi, asymptotically of level a. The limit law of T n (9 ,9 n ) will also be given 
under the alternative hypothesis Hi. The following additional conditions will be needed for 
the statement of our results. 



(C.4) We have 



and there exist Mi > and 5± > such that, for all 9 in some neighborhood of 9o, one 
has, for % — 1, 2, 



d d 

— m{9, Mi, u 2 ) — m{9, u x , u 2 )c e (u 1 , u 2 ) 



where r(u) := u(l — u) for u G (0, 1); 



<M 1 r(^)" 1 - 5+<5l r(^3-0 O - 5+51 , 



(C.5) There exist a neighborhood N(9 T ) of 9 T) and functions r { E 1Z such that for all 9 G 
N(9t), we have 
d 



^m(6,ui,u 2 ) < ri(ui)r 2 {u 2 ) with j ' {ri{ui)r 2 (u 2 )} 2 dCg T (u 1 ,u 2 ) 
(C.6) There exist functions r i: G TZ, qi G Q % — 1, 2 such that 



< oo; 



_d_ 

dui 



m(9 T ,u 1 ,u 2 ) < ri(ui)r 2 (w 2 ), 



m{9,u 1 ,u 2 ) < ri(ui)rj(uj),i, j = 1,2 and i 7^ j, 



with 



and 



{ri(Mi)r 2 (M 2 )} 2 (iC9 T (Mi,M2) < 00, 



y dC^UuUi) < 00, i,j = 1,2 and i ^ j. 

Theorem 3.1 Assume that conditions (C. 1-2-3-4) hold. Then, under the null hypothesis 
Ho, the statistic T n converges in distribution to a x\ random variable (with 1 degree of 
freedom). 
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Theorem 3.2 Assume that conditions (C. 1-2-3) and (C.5-6) hold. Then, under the alter- 
native hypothesis Hi, we have 



v^(^-X 2 (£o,£t)) 



converges to a centered normal random variable with variance 



a\ := Var [m (9 T , F x {X lk ), F 2 {X 2k )) + Y 1 {9 T , + Y 2 {6 T , X 2 )\ 



where 



Yi(e T ,Xi) := ^t{F l (x i )<u l }^-rn(e T ,u 1 ,u 2 )c eT ( y u l ,u 2 ) du x du 2 
The proof of Theorem 13.11 is postponed until section 



(3.2) 



Remark 3.1 An application of Theorem \3.1\ leads to reject the null hypothesis Ho '■ 6t = #o? 
whenever the value of the statistic T n exceeds qi- a , namely, the (1 — a)-quantile of the law of 
the random variable \\- The test corresponding to this rejection rule is then, asymptotically 
of level a, when n — ► oo. Accordingly, the critical region is given by 



CR := {T n > gi „ Q } 



(3.3) 



The fact that this test is consistent follows from Theorem 2. 3. Further, this theorem can be 
used to give an approximation t o the p ower function 9r >— » P(0r) := Pp T {C R\ in a similar 



way to 
that 



Keziou and Leoni-Aubin {(200a) and lKeziou and Leoni-Aubin \200 r \) . We so obtain 



(3{6 T ) » 1 - $ 



^ 91 - Q -X 2 (0o,0t)) 



a 



2n 



(3.4) 



where $ denotes, as usual, the distribution function of N(0, 1) standard normal random 
variable. A useful consequence of fffjU is the possibility of computing an approximate value of 



the sample size ensuring a specified power (3 (6?) , with respect to some pre- assigned alternative 
@t 7^ ^o- Let n be the positive root of the equation 



a 



r( q -^-x 2 (0o,e T ) 



X 



2n 
13 



(3.5) 



which can be rewritten into 



{a + b) - ^a(a + 2b) 

n ° = — W55 — ' (3 - 6) 

where a := a 2 -P)) and b '■= <?i-aX 2 (#o, #t)- The sought-after approximate value 

of the sample size is then given 

n* := L^oJ + 1, 
where \u\ denote the integer part of u. 

Remark 3.2 If the parameter 9 does not belong to Q, i.e., 9 G e — 0, the densities eg, 
with respect to the Lebesgue measure, associated to Cg, may become negative on some non 
negligible subsets of I. So, in this case 



Ce(ui,u 2 ) := / c e {u ll u 2 ) d\(ui, u 2 ); Vw 6 /, (3.7) 

J(0,ui)x(0,u 2 ) 

may be a signed measure (and not a copula) for some 9 6 G e — 0. Hence, the formula \2. 7| ) 
may lead to a value of the estimator 9 n belonging to e — ; and the corresponding Cg is 
not necessarily a copula. So, for point estimation, the pseudo-maximum likelihood estimator 
(restricted to vary in the admissible domain) should be used instead of \2. 7) . The latter 
estimator may not be meaningful, and is likely to have a larger mean square error. The main 
advantage of our formula \2. 6]) . where the supremum is taken on the extended space e; 
instead of the admissible domain Q, is that it permits easily to build a test of independence 
even when the dimension of the parameter space is larger than 1 . We can show that the 
proposed test statistic, based on the formula \2. 6)) . has a x 2 (p) limit law with p = dim{Q) 
degrees of freedom. 

Remark 3.3 The asymptotic variances 112. 10\) and Ii3.2\) may be consistently estimated re- 
spectively by the sample variances of 

+ Wx@ n ,X lik ) + W 2 (9 n ,X 2:k ), k = 1, . . . ,n, (3.8) 

(9 n ,F ln (X 1>k ),F 2n (X 2ik )^ + Yi(0 n ,Xi >fe ) + Y 2 (9 n ,X 2tk ), k = l,...,n, (3.9) 
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as was done in 



Genest et al. 



§199 a ). Similarly, the parameter H in 112.9]) may be consistently 



estimated by the sample mean of 



m 171 [ ®ni Fln(Xi : k), F 2n {X- 



2,fc. 



k = 1 n. 



(3.10) 



4 Simulation results 

In this section, we present some simulation results aiming to illustrate the theoretical results 
of 13.11 These results relate to the accuracy of the approximations of the laws of the test 
statistics T n , by the limit law given in Theorem 13. II As an example, we consider the Clayton 
family of copulas 

C ( Ul ,u 2 ) := (uf-u^- l)~ 1/<? ; for 0e[O,oo). (4.1) 

The case of the test statistic T„ based on the x 2 -divergence. Under the null hypothesis of 
independence of margins (here 9t = @o '■= 0), Theorem 13.11 shows that the asymptotic law of 
the proposed test statistic T n is a xt 1 &W with one degree of freedom; see, e.g., Theorem 13. II 
We have illustrated this convergence by simulation: in Figure 1 and Figure 2 are plotted the 
distribution function of the limit law xt, an d the distribution functions of the statistic T n 
obtained from 500 independent runs for sample sizes n = 25 and n = 100. We observe that 
the approximation is also good even for moderate sample sizes (see Figure 1 and Figure 2). 
The integral in the expression of m{6, ■, ■) is calculated by the Monte Carlo method, and the 
supremum in (13.1 1) is considered on the extended space 9 e ; the supremum in (13 .ip has been 
computed on a neighborhood of 6q := by the Newton- Raphson algorithm taking 6$ := 
as an initial point. 



5 Conclusion 

We have presented a new approach based on x 2 -divergence and duality technique in the 
framework of parametric copulas. It seems that the procedure introduced here is particularly 
well adapted to the boundary problem. 
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Copule de Clayton. n=25 T rep=500. 




x 



Figure 1: Simulated distribution function of the statistic T n and its limit law x\- (Sample 
size 25, 500 replications.) 



Copule de Clayton. n=100, rep=500. 




x 



Figure 2: Simulated distribution function of the statistic T n and its limit law x\- (Sample 
size 100, 500 replications.) 

6 Appendix 

First we give a technical Lemma which we will use to prove our results. 

Lemma 6.1 Let F have a continuous margins and let C have continuous partial derivatives. 
Assume that j is a continuous function, with bounded variation on I. Then 

Jj(u h u 2 ) d(C n ( Ul ,u 2 ) - C(ui,u 2 )) = O (n-^Qoglogn) 1 / 2 ) (a.s.). (6.1) 
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Proof of Lemma 16.11 Recall that the modified e mpirical copula CL , is slightly different 
from the empirical copula C n (ui,u 2 ), introduced by iDeheuveld (jl979al ). and defined by 

C n (u l ,u 2 ) = F n (F£(u 1 ),F£(u 2 )) for (ui, u 2 ) G (0, l) 2 , (6.2) 

where F t ~ n (•) and F 2 ~ (•) denote the empirical quantile functions, associated with F ln (xi) = 
F„(xi,oo) and -F 2n (x 2 ) = F n (oo,x 2 ), respectively, and defined by 



Fr\t) := inf {x G R | F jn (x) > t}, j = 1, 2. 



(6.3) 



Here, F n (-,-) denotes the joint empirical distribution function, associated with the sample 
{(X lk , X 2k ); k = 1, . . . , n}, defined by 



1 n 

F n (x 1 ,x 2 ) = - y~]t 

n L — ' 



oo < Xi, x 2 < oo. 



(6.4) 



k=l 



We know that C n and C n coincide on the grid {(i/n,j/n) , 1 < i < j < n} . The subtle 
difference lies in the fact that C n is left- continuous with right-hand limits, whereas C n on 
the other hand is right continuous with left-hand limits. The difference between <C n and C n , 
however, is small 

z-1 j — I 



sup |C„(u) — C n (u)\ < max 

uG-f l<ij<n 



Using integration by parts, as in 



Cn ( - 



n n 



Fermanian et al 



C, 



n 



n 



2 

< -. 

n 



(6.5) 



(j2004al ). we see that 



j(mi,m 2 ) d(C n - C){ui,u 2 ) = j Vn(C n - C)(u 1 ,u 2 ) dj{ux,u 2 ) 
v/n(C n -C)(ui,l) dj(u x ,u 2 ) - / v/n(C n - C)(l,u 2 ) dj(u 1 ,u 2 ) 



Hence, 



[0,1] 



n / j{u u u 2 ) d{C n - C){ux,u 2 ) 



y/n(C n (ui, 1) - ui) dj(ui, 1) - / x/^CCn^,"^) - u 2 ) dj(l,u 2 ) 



[0,1] 



<5v^sup|(C„,-C)(u)| / d\j(u) 

, u£l . . . J I 



From this and (16.51) . applying Theorem 3.1 in iDeheuveld (Il979al ). we obtain the following 
result 

r j( Ul ,u 2 ) d(C n ~C)(u 1 ,u 2 ) = 0(n- 1 / 2 (\og\ogn) 1 / 2 ) (a.s.). 
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Proof of Theorem 12.11 (a) Under the assumption (C.l), a straightforward calculus yields 



J -§gm(9 T ,u 1 ,u 2 ) dC 9T (u 1 ,u 2 ) = 0. 



(6.6) 



Under the assumptions (C.l), and by applying Proposition 2.2 in 
can see that, as n — > oo, 



Genest et al. 



jl995[ ). we 



-^m(9 T ,u 1 ,u 2 ] 



dC n (u 1 ,u 2 ) -> y -§grn(9 T ,u 1 ,u 2 ) dCe T (u 1 , u 2 ) = 0, 



(6.7) 



and 



y -^m(9 T ,ui,u 2 ) dC n (u 1 ,u 2 ) -> y j^rn(9 T ,ui,u 2 ) dCe T (u 1 ,u 2 ) := -H < 0, (6.8) 

almost surely. Now, for any 9 = 9? + vn -1 ^ 3 , with |u| < 1, consider a Taylor expansion of 
/ m(9,ui,u 2 ) dC n (ui,u 2 ) in 6 1 in a neighborhood of 6>t, using (C.l) part 2, one finds 



n / m(9,ui,u 2 ) dC n (u 1: u 2 ) — n / m(9 T ,ux,u 2 ) dC n {ui,u 2 ) 



(6.9) 



2/3 v / jjnm(9 T ,ui,u 2 ) dC n (u 1: u 2 ) +n 1/3 — / ^Lm(9 T ,Ui,u 2 ) dC n (u h u 2 ) + 0(1) (a.s.) 



: n - 

• yy — y- j. ■, ~±, v - j. , . ■- „ I 

'/ Z J/ 

uniformly in t> with \v\ < 1. On the other hand, under condition (C.2), by Lemma [6.11 we 
have 

y ■^m(9 T ,u 1 ,u 2 )dC n (ui,u 2 ) = 0(n~ 1/2 (\og\ogn) 1/2 ) (a.s.). 
Therefore, using (16. 8p and (16. 9p . we obtain for any 9 = 9t + vn~ 1 ^ 3 with \v\ = 1, 



(6.10) 



n I m(9,ui,u 2 ) dC n (ui,u 2 ) - n J m(9 T ,u u u 2 ) dC n (ui,u 2 ) 
0(n 1/6 (loglogn) 1/2 ) - 2~ 1 En 1/3 + 0(1) (a.s.). 



Observe that the right-hand side vanishes when 9 = 9t, and that the left-hand side, 
by (16. 8p . becomes negative for all n sufficiently large. Thus, by the continuity of 9 i— > 
J m(9,Ui,u 2 ) dC n (ui,u 2 ), it holds that as n — * oo, with probability one, 



9^1 m(9,ui,u 2 ) dC n (ui,u 2 ) 



reaches its maximum value at some point 9 n in the interior of the interval B. Therefore, the 



estimate 9 n satisfies 



J^m(9 n ,u u u 2 )dC n (u 1 ,u 2 ) = 0md \9 n ~9 T \=0(n-^ 3 ). (6.11) 
(b) Making use of the first part of Theorem 12.11 and once more, by a Taylor expansion of 



^m(9 n ,ui,u 2 ) dC n (ux,u 2 ) 



with respect to 9 n , in the neighborhood of 9t, we obtain that 



= I jk m (8n,ux,u 2 ) dC n (ui,u 2 ) = J ^ s m(9 T ,ux,u 2 ) dC n (u 1 ,u 2 ) 

+ (9 n ~ T ) J j^m{9 T , Mi, u 2 ) dC n {u u u 2 ) + o(n~ 1/2 ). 



Hence, 



V^(0n - Br) 



-i 



jLm{9 T) Ul , u 2 ) dC n ( Ul ,u 2 ) V^W n (9 T ) + o(l), (6.12) 



where 



W n {9 



r) := i ■§ s m(9 T ,u 1 ,u 2 ) dC n {u u u 2 ). 



Tsukaharal (120051 ) . under assumptions (C.l) part 1 and 



Applying Proposition 3 page 362 in 
(C.3), we have 

V^W n (9 T ) ^>iV(0,£ 2 ). (6.13) 

Finally, by combining (16.131) and (16. 8p in connection with Slutsky's Theorem, we conclude 
that 

V^(0 n - 9 T ) -> N(0, £ 2 /~ 2 ). (6.14) 
This completes the proof. ■ 

Proof of Theorem 13.11 Assume that 9 T = 9 . From (16.121) . using (I6.8P and (16 . 1 3j) . we 

obtain 



1 



V^[0n-9 T )= --V^W n (9 T ) + o P (l). 



(6.15) 
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Expanding in Taylor series T n (9 ,9 n ) in n around Ot, we get 



T n (9 , 9 n ) = 2nW n (9 T )(9 n - 9 T ) - Y,n{9 n - 9 T f + o P (l). 



(6.16) 



Now, use ( I6.15P combined with (I6.16P to obtain 



T n (9 ,9 n ) = ^nW n (9 T ) 2 + o P (l). 



(6.17) 



By (16.1 3D . we have 



\faW n (0 T ) ^iV(0,£ 2 ) (6.18) 
in distribution. When 9t = On , under Assumption (C .4), we can see that S 2 in flBTTBl is equal 



to 1/5; see Proposition 2.2 in 
that 

under the null hypothesis 7i - 



Genest et al. 



(119951 ). Combining this with (16.171) to conclude 



1 n Xli 



Proof of Theorem 13.21 Rewriting ^ as 



T n (6> , n ) 
2n 



sup / m(0,u 1 ,u 2 ) dC n (ui,u 2 ), 
eee e J i 



(6.19) 



and making use of a Taylor expansion of Tn( ^°' en) , with respect to n , in a neighborhood of 



Ot, under (C.5) to obtain 

T„,(#o? T , 



2n 



m(0 T ,Ui,u 2 ) dC n (ui,u 2 ) + o P (n 1/2 ) 



Hence, one finds 
'T 

- 1 - n. 



» I ^-X 2 (0 ,9 T ) 



/n^y m(9 T ,ui,u 2 ) dC n - j m(9 T ,ui,u 2 ) dC 6r 
+o P (l). 



Final 



(120051) 



y, under condition (C.6), application once more of Proposition 3 page 362 in iTsukahara 
concludes the proof. ■ 
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